AITopics | segmentation performance

Collaborating Authors

segmentation performance

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli

Neural Information Processing SystemsMar-22-2026, 21:46:06 GMT

Humans excel at detecting and segmenting moving objects according to the {\it Gestalt} principle of "common fate". Remarkably, previous works have shown that human perception generalizes this principle in a zero-shot fashion to unseen textures or random dots. In this work, we seek to better understand the computational basis for this capability by evaluating a broad range of optical flow models and a neuroscience inspired motion energy model for zero-shot figure-ground segmentation of random dot stimuli. Specifically, we use the extensively validated motion energy model proposed by Simoncelli and Heeger in 1998 which is fitted to neural recordings in cortex area MT. We find that a cross section of 40 deep optical flow models trained on different datasets struggle to estimate motion patterns in random dot videos, resulting in poor figure-ground segmentation performance. Conversely, the neuroscience-inspired model significantly outperforms all optical flow models on this task. For a direct comparison to human perception, we conduct a psychophysical study using a shape identification task as a proxy to measure human segmentation performance. All state-of-the-art optical flow models fall short of human performance, but only the motion energy model matches human capability.

artificial intelligence, large language model, natural language, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.54)

Add feedback

61aa557643ae8709b6a4f41140b2234a-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 16:32:50 GMT

artificial intelligence, machine learning, zhang, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)

Add feedback

5af741d487c5f0b08bfe56e11d1883e4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 06:38:13 GMT

large language model, machine learning, segmentation, (19 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Surgery (1.00)
Health & Medicine > Health Care Technology (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.93)

Add feedback

ASurprisinglySimpleApproachto GeneralizedFew-ShotSemanticSegmentation

Neural Information Processing SystemsFeb-10-2026, 14:48:04 GMT

Inthis paper,wepropose asimple yet effectivemethod for GFSS that does not use the techniques mentioned above. Also, wetheoretically show that our method perfectly maintains the segmentation performance of the base-class modelovermostofthebaseclasses. Through numerical experiments, we demonstrated the effectiveness of our method. It improved in novel-class segmentation performance in the1-shot scenario by6.1% on the PASCAL-5i dataset,4.7%on

artificial intelligence, justification, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Europe > Germany (0.04)

Genre: Research Report (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

A Supplementary Material A.1 AMWC Heuristic

Neural Information Processing SystemsFeb-9-2026, 15:14:34 GMT

Whenever a merge operation is performed the corresponding edge is contracted and new edges can potentially be created (Lines 7-17). Afterwards, the clusters belonging to non-partitionable class (i.e stuff) are merged. Table 2 contains the hyperparameters used for fully differentiable training. We can see that optimizing PQ surrogate gives better performance and using separate losses decreases the performance especially on'thing' classes. Table 5 showing that all trials improve over the baseline by fully differentiable training.

artificial intelligence, differentiable training, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

PolarMix SupplementalMaterial

Neural Information Processing SystemsFeb-8-2026, 17:05:52 GMT

Wefirst implement global augmentation approaches including random rotation and random scaling on two LiDAR scans separately and thenconcatenate themfortraining. The more copies the better segmentation performance as shown in ' 1, 2, 3' in the table, which indicates the effectiveness of the approach in enriching data distribution. In this section, we conducted experiments to analyze how PolarMix benefits LiDAR point cloud learning. As a comparison, PolarMix is more robust to the instance spatial location without much performance drop. PolarMix improves the robustness of the baseline clearly with respect to the angular variations of instances (i.e.

artificial intelligence, polarmix, supplementalmaterial, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.36)

Add feedback

A Surprisingly Simple Approach to Generalized Few-Shot Semantic Segmentation

Neural Information Processing SystemsDec-24-2025, 19:16:50 GMT

The goal of *generalized* few-shot semantic segmentation (GFSS) is to recognize *novel-class* objects through training with a few annotated examples and the *base-class* model that learned the knowledge about the base classes.Unlike the classic few-shot semantic segmentation, GFSS aims to classify pixels into both base and novel classes, meaning it is a more practical setting.Current GFSS methods rely on several techniques such as using combinations of customized modules, carefully designed loss functions, meta-learning, and transductive learning.However, we found that a simple rule and standard supervised learning substantially improve the GFSS performance.In this paper, we propose a simple yet effective method for GFSS that does not use the techniques mentioned above.Also, we theoretically show that our method perfectly maintains the segmentation performance of the base-class model over most of the base classes.Through numerical experiments, we demonstrated the effectiveness of our method.It improved in novel-class segmentation performance in the $1$-shot scenario by $6.1$% on the PASCAL-$5^i$ dataset, $4.7$% on the PASCAL-$10^i$ dataset, and $1.0$% on the COCO-$20^i$ dataset.Our code is publicly available at https://github.com/IBM/BCM.

artificial intelligence, generalized few-shot semantic segmentation, machine learning, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Panoramic Out-of-Distribution Segmentation

Duan, Mengfei, Zhang, Yuheng, Cao, Yihong, Teng, Fei, Luo, Kai, Zhang, Jiaming, Yang, Kailun, Li, Zhiyong

arXiv.org Artificial IntelligenceDec-12-2025

Panoramic imaging enables capturing 360° images with an ultra-wide Field-of-View (FoV) for dense omnidirectional perception, which is critical to applications, such as autonomous driving and augmented reality, etc. However, current panoramic semantic segmentation methods fail to identify outliers, and pinhole Out-of-distribution Segmentation (OoS) models perform unsatisfactorily in the panoramic domain due to pixel distortions and background clutter. To address these issues, we introduce a new task, Panoramic Out-of-distribution Segmentation (PanOoS), with the aim of achieving comprehensive and safe scene understanding. Furthermore, we propose the first solution, POS, which adapts to the characteristics of panoramic images through text-guided prompt distribution learning. Specifically, POS integrates a disentanglement strategy designed to materialize the cross-domain generalization capability of CLIP. The proposed Prompt-based Restoration Attention (PRA) optimizes semantic decoding by prompt guidance and self-adaptive correction, while Bilevel Prompt Distribution Learning (BPDL) refines the manifold of per-pixel mask embeddings via semantic prototype supervision. Besides, to compensate for the scarcity of PanOoS datasets, we establish two benchmarks: DenseOoS, which features diverse outliers in complex environments, and QuadOoS, captured by a quadruped robot with a panoramic annular lens system. Extensive experiments demonstrate superior performance of POS, with AuPRC improving by 34.25% and FPR95 decreasing by 21.42% on DenseOoS, outperforming state-of-the-art pinhole-OoS methods. Moreover, POS achieves leading closed-set segmentation capabilities and advances the development of panoramic understanding. Code and datasets will be available at https://github.com/MengfeiD/PanOoS.

machine learning, natural language, segmentation, (16 more...)

arXiv.org Artificial Intelligence

2505.03539

Genre: Research Report (0.81)

Industry:

Health & Medicine (0.67)
Transportation > Ground > Road (0.48)
Automobiles & Trucks (0.48)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
(4 more...)

Add feedback

Filters

Collaborating Authors

segmentation performance

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli

aa5a581a221f1e431fcad9d7ccfedaa2-Paper-Conference.pdf

c74a3a6f44a44b204e26b1a6d7fe4a66-Supplemental-Conference.pdf

61aa557643ae8709b6a4f41140b2234a-Supplemental-Conference.pdf

5af741d487c5f0b08bfe56e11d1883e4-Paper-Conference.pdf

ASurprisinglySimpleApproachto GeneralizedFew-ShotSemanticSegmentation

A Supplementary Material A.1 AMWC Heuristic

PolarMix SupplementalMaterial

A Surprisingly Simple Approach to Generalized Few-Shot Semantic Segmentation

Panoramic Out-of-Distribution Segmentation